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ABSTRACT 

We infer the local stellar-to-halo/subhalo mass relations (MRs) for central and satellite galaxies 
separately. Our statistical method is extending the abundance matching, halo occupation distribution, 
and conditional stellar mass function formalisms. We constrain the model using several combinations 
of observational data, consisting of the total galaxy stellar mass function (GSMF), its decomposition 
into centrals and satellites, and the projected two-point correlation functions (2PCFs) measured in 
different stellar mass (Af,) bins. In addition, we use the ACDM halo and subhalo mass functions. 
The differences among the resulting MRs are within the model-fit uncertainties (which are very small, 
smaller than the intrinsic scatter between galaxy and halo mass), no matter what combination of 
data are used. This shows that matching abundances or occupational numbers is equivalent, and 
that the GSMFs and 2PCFs are tightly connected. We also constrain the values of the intrinsic 
scatter around the central-halo (CH) and satellite-subhalo (SS) MRs assuming them to be constant: 
(Tc = 0.168 ± 0.051 dex and ct^ = 0.172 ± 0.057 dex, respectively. The CH and SS MRs are actually 
different, in particular when we take the subhalo mass at the present-day epoch instead of at their 
accretion time. When using the MRs for studying the satellite population (e.g., in the Milky Way, 
MW), the SS MR should be chosen instead of the average one. Our model allows one to calculate 
several population statistics. We find that the central galaxy is not on average within the mass 
distribution of the most-massive satellite, even for cluster-sized halos, i.e., centrals are not a mere 
realization of the high-end of the satellite mass function; however for > 3 x 10"'^'^ M© halos, ~ 15% 
of centrals could be. We also find that the probabilities of MW-sized halos of having N Magellanic- 
Clouds (MCs)-sized satellites agree well with observational measures; for a halo mass of 2 x 10^^ Mq, 
the probability to have 2 MCs is 5.4%, but if we exclude those systems with satellites larger than the 
MCs, then the probability decreases to < 2.2%. 

Subject headings: galaxies: abundances — galaxies: evolution — galaxies: halos — galaxies: luminos- 
ity function, mass function — galaxies: statistics — cosmology: dark matter. 



1. INTRODUCTION 

The statistical description of the galaxy population is a 
valuable tool for understanding the properties of galax- 
ies and the way they cluster, as well as the role that 
mass and environment play in shaping these properties. 
Moreover, statistical descriptors such as the luminosity 
function, the galaxy stellar mass function (GSMF), and 
the two-point correlation function (2PCF) has allowed us 
to probe galaxy evolution and its connection to the cos- 
mological initial conditions of structure formation (e.g., 
lPeeblesl[l980l : lYoo et al.l 120091 ). Such a connection is of 
vital importance in studies devoted to the development 
of the current ACDM cosmological paradigm. A key in- 
gredient in these studies is the link between galaxy and 
dark matter halo properties. Such a link allows to project 
the theoretical dark matter halo population onto the ob- 
servable galaxies. 

Recently, progress towards connecting galaxies 
and halos has been made through the develop- 
ment of several techniques for observationally es- 
timating the dark halo masses o f luminous galax- 
ies, such as weak lensing ("Mandel baum et "al 

iMandelbau m. Scliak & Hirata 1200 
Schulz. Mandelbaum fc Padnianabhan 2010), kinemat 



l2009ll2011l:IWoitak fc Mamonll2013D, and galaxy clusters 



Yang. Mo fc van den Bo"schl l2009al : 
Yang et al.l I2012D . However, these 



ics of satellite galaxies (Conrov et al., .2007; .More et al.l 
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(lYang et all 120071: 
[Hansen et al.l 120091 : 

direct probes of halo mass still have large uncertainties. 

Consequentially, semi-empirical approaches that link 
the galaxy and dark matter halo distributions statisti- 
cally are of great importance. For example, the Halo Oc- 
cupation Distribution (HOD) formalism, which describes 
the probability for finding N galaxies in halos of mass 
Mh, has been used successfully to understand the non- 
linear relation between the distribution of galaxies and 
matter, for i n stance, at the level of the power spectra 
flSeliag [2OOOI: IPeacock fc SmithI [20001: ICoorav fc ShethI 
I200I " [Yoo et al.112009'), or the two-point correlation func- 
tionsjBerhnd & Weinberg 2002; Coorav & Shcth 2003 
'Zehavi et al. 2005; Abbas & Shcth 2006; Tinker et aL 
2008b; Zeh avi ct a l. 201 1; Wat son, Bcrli nd fc Zentner 
20111 : iWatson et al.l I2OI2I : [Wake. Franx fc van Dokkum 
2OI2I and references therein). 

However, the HOD model provides only information 
on the total number of galaxies above some luminosity or 
stellar mass threshold per halo, and constrains only the 
halo mass of the central galaxy. In order to describe the 
detailed halo occupation a nd mass distribution of cen- 
tral a nd satellite galaxies, lYang. Mo fc van den Boscbl 
(|2003| ) introduced the conditional luminosity (or stel- 
lar mass) function (GSMF) in the HOD model (see 
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also e.g . . lYang. Mo fc van den BoschI l2009al hereafter 
YMBOalMqster et al.ll2010l:lLeauthaucl et al ]l20Tll [20T2l: 



Yang et all \20lW . The CSMF is defined as the av- 
erage number of galaxies with stellar masses between 
M^, ± occupying a halo of a given mass Afh. Nev- 

ertheless, both the HOD model and the CSMF formalism 
assume a parametric description for the satellite popu- 
lation distributions which is constrained using observa- 
tions. 

In order to avoid an arbitrary parametric description 
for the satellite population, the above models can 
be generalized with the abun dance matching tech- 
nique (hereafte r, AM T; e.g.. iVale fc Ostrikeij 120041: 
Kravtsqv et alj 120041 : iConroy. Wechsler fc KravtsoyI 



20061: iSha nkar et al. ' 120061; 'Weinbe rg et all 120081: 



Baldrv. Gla zcbrook & D river 2008; Co nrov fc Wechsle: 



20091: iDrorv et al.l l200l iMoster et all 



Behroozi. Conrov fc Wechsler ! 12010: Guo et all 




Behroozi. Wechsler fc Conro v 2012; Reddick et al.l l: 
Papastergis et al.l I2012D . ^Undcr the hypothesis that 



there exists a one-to-one monotonic relation between 
stellar mass and (sub)halo mass, the matching the 
total galaxy and halo plus subhalo abundances yields 
a global (average) relation between and Mh. Note 
that in this simple procedure, the central-to-halo and 
satellite-to-subhalo mass relations (hereafter CHMR and 
SSMR, resp e ctively ) are not diff'erentiated. Recently, 
iSimha et al.l ()2012[ ) have found in their cosmological 
N-body/hydrodynamics simulations that both mass 
relations are nearly identical if the subhalo masses, 
msub, are defined at their accretion times. Additionally, 
previous studies have shown that when the AMT results 
are applied to the HOD model with mgub defined at the 
accretion epoch, then th e spatial clu stering of galaxies 
is m o stly recovered (e.g . lConrov.~W echslcr fc Kravtsov 
120061 : IMoster et al.l 120101 ) . Similar results are expected 
when msub is defined at the observation time but a 
global offset is applied to account for the average effect of 
subhalo mass loss due to tidal stripping ()Vale fc OstrikeH 
[2004; Weinberg ct al. 2008). 

On the other hand, there is no reason to assume a priori 
the S S MR to be identical to the CHMR (.Neistein et all 
I20I1I iRodn'guez-Puebla. Drorv fc Avila Reesd 120121 
hereafter IRDA12n . For accretion-time msub, such an 
assumption implies that the change of the stellar masses 
of satellites after their accretion will be such that they 
would occupy the z = central-to-halo mass relation or, 
more generally, that the CHMR almost does not change 
with time. Recent studies based on large halo-ba sed 
group catalogs (e.g., iWetzel. Tinker fc Conrovll2012D or 
on the predicted bu lge-to-total mass ratio of central 
galaxies! ) Zavala et al.l 12012) have shown that once 
satellite galaxies are accreted, they evolve roughly as a 
central galaxy at least for several Gyrs. This could imply 
that the SSMR with rusub defined at accretion time may 
not be equal to the z = CHMR. Neverthele ss, in the 
cosmological simulations of ISimha et al.l (|2012D . despite 
the fact that satellites continue to grow after accretion, 
both mass relations end up similar. It is therefore likely 
that the growth in mass as well as the change of the 
CHM R with time are very small. 

In RDA12I we extended the AMT to determine the 
CHMR and SSMR separately, using the observed decom- 
position of the GSMFs into centrals and satellites. We 



have found that indeed the SSMR is not equal to the 
CHMR, and that applying them to the HOD -I- CSMF 
model leads to satellite CSMF and correlation functions 
in excellent agreement with observational data. Actually, 
when msub is defined at the accretion time, the z — 
m ass relat ions become cl ose but n ot equal (see Fig. 2 
in lRDA12D . Additionally, IRDA12I show that the uncer- 
tainty in the AMT related to the satellite stellar mass 
growth can be avoided if subhalo masses are defined at 
the time of obse rvation rather than at the time of ac- 
cretion. IRDAI2I also suggest that the central-halo and 
satellite-subhalo mass relations can be determined simul- 
taneously using the correlation functions as observational 
input, instead of the GSMF decomposed into satellites 
and centrals. This is presumably because matching abun- 
dances of satellite to subhalos is essentially equivalent to 
matching their corresponding occupational numbers (and 
vice versa). 

In the present paper, we aim to test the above state- 
ments. We will also probe how robust the determinations 
of the central-to-halo and satellite-to-subhalo mass rela- 
tions through our extended AMT and HOD-f CSMF com- 
bined model are. We will explore whether these mass re- 
lations vary significantly depending on the combinations 
of observational data being used; in particular, we will 
explore whether the uncertainties in the model parame- 
ters that describe the mass relations shrink significantly 
when more observational constraints are added. 

Our model proves to be a powerful tool for connecting 
the ACDM (sub)halo statistics to the statistics of the 
central/satellite galaxy populations. In this sense, one 
may predict many halo occupational distributions and 
probabilities as a function of scale; for instance, the mass 
distribution of the most massive satellites as a function of 
halo mass or the probability of a halo hosting N satellites 
in a given stellar mass range or, more generally, the whole 
satellite CSMF. We will discuss some results obtained for 
these occupational distributions and probabilities. 

In Section 2 we describe our extended AMT and 
HOD+CSMF model, and present the different combina- 
tions of data to be used to constrain the model parame- 
ters. The results of our model for the different data sets 
are presented in Section 3. In particular, we compare 
the central-halo and satellite-subhalo mass relations ob- 
tained using different data sets. We also constrain the in- 
trinsic scatter around the mean central-halo and satellite- 
subhalo mass relations. In §4 we discuss the halo occu- 
pational statistics related to the halo mass dependence 
of the satellite CSMF, the stellar mass gap between the 
most massive satellite and the central galaxy, and the 
probabilities of Milky- Way (MW) sized halos having 1, 
2, or more Magellanic Cloud-sized satellites. Section 5 is 
devoted to discuss the robustness of the obtained mass 
relations and their model uncertainties, as well as the im- 
plications of extrapolating our obtained SSMR to masses 
as small as the MW dwarf spheroidal galaxies. Finally, 
we present our conclusions in Section 6. 

We adopt cosmological parameter values close to 
WMAP 7: Qa = 0.73, f^M = 0.27, /i = 0.70, = 0.98 
and (Ts = 0.84. 

2. METHODOLOGY 

In the following we present our model connecting 
galaxies to halos and subhalos via their occupational 



Galaxy-halo/subhalo connection 



3 



numbers. This is done under the assumption that on 
average the central-to-halo and satellite-to-subhalo re- 
lations are monotonic. The model relates in a self- 
consistent way the GSMF decomposed into centrals and 
satellites, the ACDM halo/subhalo mass functions, the 
satellite CSMFs, and the galaxy projected 2PCFs. As 
a result, it constrains both the CHMR and the SSMR, 
and predicts the satellite CSMF and several other occu- 
patio nal statistics. Unlike previous models of this kind 
(e.g., IMoster et al.] IMS), the CHMR and SSMR are 
treated separately. 

2.1. Connecting galaxies to halos and subhalos 

The total GSMF is decomposed into satellites and cen- 
tral galaxies, 

which after integration yields the mean cumulative num- 
ber density of galaxies with stellar masses greater than 



^dMJ = 



/"OO 

(t>g,ccndMj + / (j)g_satdMj , (2) 
J M, 



or, in short, 

ng{> M*) = ng,ccn(> M*) -|- ng,sat(> M*)- (3) 

2.1.1. Central galaxies 

For constructing the central GSMF, we will use the 
conditional probability that a given halo of mass Mh is 
inhabited by a central galaxy with stellar mass between 
± Pcon(M*|Mh)dM*, and assume this distri- 

bution to be log-normal: 



Pcc„(M,|Mh)rfAf, = 



dAL 



exp 



v/2^Af, In(lO) 
log'(M,/M,,c(Mh)) 



(4) 



with CTc being the intrinsic scatter (width), expressed in 
dex units, around log Af,.c(-^^h), the mean stellar-to-halo 
mass relation of central galaxies (CHMR). Formally, 
Pccn{Mf\Mh) maps the halo mass function, HMF, onto 
the central GSMF, thereby encoding all the physical pro- 
cesses involved in galaxy formation inside the halos. We 
parametr ize logMH.,c(Afh) using the functional form pro- 
posed bv lBehroozi. Wechsler fc Conrovl ()2Q12f ). 

logM,,c(A/h) = log(ecMi,c) + /(log(Mh/A/i,c)) - /(0(5) 



where 



fix) = <5c 



(log(l + e-))^= 



1 



,10- 



log(10"<=^ + 1). (6) 



This function behaves as power law with slope a at 
masses much smaller than Afi^c, and as a sub-power 
law with slope 7c at larger masses. This parametriza- 
tion maps the A CDM HMF to a Schechter-like GSMF 
(|Schechtedll97i) . 

The mean number density of central galaxies with stel- 
lar masses between M*±dM*/2, (i.e., the central GSMF) 



is given by 

(f>g.ccn{M,)dM, = dM, I Pco„(M,|Mh)(/)h(Mh)dMh, 



(7) 

where (/)h(A'/h) is the distinct HMF. We use the fitted 
results to the disti nct HMF from cosm ological simula- 
tions carried out in iTinker et al.l (|2008a[ ) as reported in 
their Appendix B. Here we define halo masses at the 
virial radius, i.e. the halo radius where the spherical 
overdensity is Avir times the mean matter density, with 
A^i, = (l8Tr^ + 82x -39x'^) /rt(z), and »(z) = p^(z)/pcrit 
and X = n{z) - 1 (jBrvan fc Normanlll998[) . 

Having defined Pcen{M^* l-ATh), the cumulative probabil- 
ity that a halo of mass Mh hosts a central galaxy with a 
stellar mass greater than is simply 



Pccn(M,|Afh)rfAf, 



(8) 



which coincides with the definition of the mean occupa- 
tional number of central galaxies, {Nc{> Af*|Mh)). Fi- 
nally, we are able to infer the mean number density 
of galaxies with stellar mass greater than M*, that is, 
ng,cc„(> M,) = (iVc(> M,|Mh))0h(Mh)dMh. 

2.1.2. Satellite galaxies 

Since satellites are expected to reside in subhalos, we 
will use a similar approach to centrals, i.e. we will estab- 
lish a link between the properties of satellite galaxies to 
those of the subhalos. However, in this case, one should 
take into account that (i) before becoming a satellite they 
occupy a distinct halo, and (ii) the subhalo mass, mgub, 
can be defined at the observation time (their present-day 
mass in our case) or at the accretion time (the epoch 
when a distinct halo beca me a sub halo). 

Item (u) is discussed in IRDA12I . First, IRDA12I show 
that once the subhalo mass function is provided for any 
definition of subhalo mass, the satellite-to-subhalo mass 
relation, SSMR, can be constrained consistently with the 
observed satellite GSMF. Therefore, the use of one or 
another is subject to practical criteria. On one hand, 
with the accretion-epoch definition, the central and satel- 
lite mass relations are alm ost the same, as o bservations 
|RDA12) and simulations (jSimha et al.ll201^ show, and 
the obtained SSMR for this case is free of a potential de- 
pendence on host halo mass. Besides, the accretion-time 
'7^sub definition is less sensitive to the specifics of the halo 
finding algorithm than the observed-time definition. On 
the other hand, the SSMR for the subhalo mass defined 
at accretion time is actually a nominal relation, where 
the abundance matching is carried out for the satellite 
GSMF at the present epoch but for a subhalo mass func- 
tion constructed for subhalos accreted at different pre- 
vious epochs. The physical interpretation of this nomi- 
nal relation requires assumptions about the evolution of 
galaxies. Instead, when matching present-day satellite 
abundances with present-day subhalo abundances, the 
connection is dire ct and n o assumptions about evolution 
are necessary fsee lRDA12l . §§4.1, for an extensive discus- 
sion). 

Here, our constraints for the SSMR refer to niguh de- 
fined at the same epoch that the observational input is 
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provided for, that is the present time. However, some re- 
sults will be presented also for the accretion-time rrisub- 
For the subhalo abundance, given as the subhalo con- 
ditional mass function, we use the results obtained in 



iBovlan-Kolchin et alJ ()2010[ ) based on the Millennium- 
II simulation. It is worth noting that the lowest subhalo 
masses we probe in this work (~ 10^" — lO^^Mo, depend- 
ing on the GSMF used) are around 3-4 orders of magni- 
tude above the mass resolution of this simulation. The 
present-day subhalo mass is the mass enclosed within a 
truncation radius, which is defined as the radius where 
the spherically-averaged density profile starts to fiatten 
or to increase with radius. The fitting formula for the 
mean cumulative number of subhalos with present-day 
(observed) mass mgub given a host halo mass Mh is: 



(^sub(> TOsublMi)) = ^J■0 



exp 



(9) 



{Mo,Mi,/^cut,a, = 
0.935,1.29}. Then, the 



where /i = msub/^fh and 

{;L J5(logM„-12.25)^ 0.010, 0.096, - 

number of subhalos of mass between TOsub i dmsub/2 re- 
siding in host halos of mass Mh (the SubhCMF), is sim- 
ply 



uh{msuh\Mh)dmsuh = rf(^sub(> n^sublMi))- (10) 

The average cumulative num ber of subhalos reported 
in IBovlan-Kolchin et all ()20l J). Eq. ©, was actually ob- 
tained for MW-sized halos. However, as the authors dis- 
cuss, the normalization factor, /io, has been found to vary 
with Mh, roughly 15% per dex in Mh. For this reason we 
introduce the q uantity /io = 25*^'°^*^''^^^-^^' (see also 
IGao et al.l[2nil[) . 

The difference in cosmology between the Millennium- 
II simulation and ours leads to differences in the re- 
sulting abundances of subhalos of roughly a few per 
cent in the amplitude of the subhalo mass function 
(jBovlan-Kolchi n et al.'' 20Tol ). and it has little effects on 
our results (see RDAlgj). In any case, we introduce a 
correction to first order, taking advantage of the fact 
that [Tinker et aTl (|2008bh provides the distinct HMF as 
a function of the relevant cosmological parameters. First, 
the subhalo mass function is ca l cula ted from Eqs. ([9]) 
and (Uni) and the [Tinker et all ([200 8b) HMF defined 
for the Millenium cosmology. Then, the "Millenium- 
cosmology" subhalo-to-halo mass function ratio is cal- 
culated, r(M) = 4>suhMuiM)/(l)h,MuiM). This ratio is 
now used to recalculate the subhalo mass function for our 
cosi nology as (j)sn h (M) = T{M)(t)b{M), where (j)h{M) is 
the [Tinker et aT] ()2008bf ) HMF for our cosmology. Fi- 
nally, assuming the same functional form for the subhalo 
conditional mass function (Eq. llOp . with the same /ig, 
we obtain the new parameters for our cosmology from 
fitting {/ii,/Ltcut,a,&} = {0.011,0.096,-0.935,1.342}. 
These are actually very cl ose to what is reported in 
IBovlan-Kolchin et al.l ()2010D . 

Analogously to centrals, for constructing the satellite 
GSMF we introduce the probability, Psat(-^^*|'Tisub)rf-^-^*, 
that a subhalo of mass msub hosts a satellite galaxy with 
stellar mass between M* ± (iM,/2. In general there is 
no reason for assuming Pccn(M*|Mi) = -Psat(-^*|TOsub)El- 

^ This assumption may actually lead to inconsistent results, even 



We again adopt a log-normal form, 

-Psat(A^*|TOsub)rfM* 



dM* 



exp 



y2^M, In(lO) 
log^(MH./M*,,(msub)) 



2a 



(11) 



where Ug is the scatter (width) around the logarithm in 
base 10 of M*^s(rnsub), the mean satellite-subhalo mass 
relation (SSMR). Similarly to centrals, we parametrize 
logM*^s(msub) using Eq. The reason is because, as 
observations suggest, the shape of t he satell i te GSMF is 
also a Schechter-like function (e.g., IYMB09I : lYang et"al] 
|2012| ). which is easily reproduced from the halo or sub- 
halo mass function using the parametrization given by 
Eq. ©. 

The next step is to link satellites to subhalos. The 
most natural way to do this is via their occupa tional 
numbers (e.g., lYang. Mo fc van den Bosch|[2009bl ). Let 
$sat(-M*|-^h) be the GSMF giving the mean number of 
satellites of stellar mass M* ± M*/2 residing in a host 
halo of mass Mhi 

$sat(M,|Mh)dM, = 

dM* / Psat(M*|n^sub)^'sub("isub|Mh)(imsub- (12) 
Jo 

The similarity with Eq. ^ is not a coincidence, since this 
is actually the AMT in its differential form but at the 
level of GSMFs. Integrating this over stellar mass gives 
the mean occupation of satellite galaxies in individual 
halos: 



{Ns{> M,\M^)) = ^,,tiM,\Mi,)dA'h. (13) 

At this point we are in a position to compute the satellite 
GSMF: 

(l)g,sAM.)dAh = dM, / $sat(M*|Mh)0h(A4)rfMh, 



(14) 

and in the case that as is a constant, 

4>g,sat{M^)dM^ = dAU / -Psat(M*|msub)0sub("^sub)rfTOsub, 

Jo 

(15) 

which is the matching of satellite galaxies to subhalos. 
The mean number density of satellite galaxies with stellar 
mass greater than Af* is given by: 



'^g,sat(> A/*) 



(iV,(>M,|Mh))(/.h(Afh)dMh. (16) 



for the accretion-time mgub definition, as shown in IRDA12I . For 
rrtgub defined at the present time, tidal stripping affects the masses 
of the subhalos producing this obviously a systematic offset be- 
tween the galaxy-halo and satellite-subhalo mass relations, which 
is sometimes i ncorporated as an assumed gl obal offset in the AMT 
analyses (e.g.. IVale fc Ostrikerf [2004 : Weinberg et al .lf200g'l. For 
'"sub defined at the accretion epoch, the two relations become ac- 
tually close according to the extended AM T analys is of RDA12 or 
to the results of cosmological simulations llSimha et al. 2012), but 
there may be still offsets and differences in scatter because of the 
uncerta in evolution of the satellites after accretion (see Fig. 2 in 
IRDA121 and Figs. 3 and 7 below). 
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Finally, note that the relation between Pcen{M^,\M\^ 
and Psat{M^,\msuh) with the distribution P(M^ \ M 
used in the standard AMT (e.g., iVale fc Ostrikeiil200 
iBehroozi. Conroy fc Wechsledl2010l ) is given bv 



P(M,|M) 



(t>suh{M) 

(?!>dm(M) 



Ps,t{A'U\M)- 



>dm(M) 



Pcc„(M*|M), 
(17) 



where (/>dm(M) — 0sub(A/)+0h(M) and M applies either 
to the distinct halo or subhalo masses. Then, the above 
equation relates the mass relation commonly obtained 
through the standard AMT with those obtained in this 
paper, 

(log M4M)) = m^(logAf,^,(Af)) + 



0dm(M) 
MM) 
0dm(M) 



(logA4,,(M)), (18) 



where ]Vh^^{M) and M^^^{M) are the SSMR and CHMR, 
respectively. It is worth noting that the standard AMT 
is recovered if both Pcon(-M,|Mh) and -Psat(-/Vf*|msub) 
are assumed to be 5— functions. Then ng^ccn(> -M*) + 
"■g,sat(> M^) = nsub(> Afh) + "^h(> Mh). For a detailed 
discussion see IRDA12I . 

2.2. The two-point correlation Junction 

So far, the galaxy- (sub)halo link is based on an ex- 
tended AMT. However, having modeled the occupa- 
tional numbers for central and satellite galaxies, we 
can now introduce information related to the spatial 
clustering. For convenience, we will write {N) = 
(A^(>M,|Mh)), {N,) = {N,{>M,\M^)) and (iV,) = 
{N,{> M,\M^)). 

As usual, the two-point correlation function is decom- 
posed into two parts, 

1 + egg(r) = [1 + + [1 + &)l (19) 

where £,^{t) describes pairs within the same halo (one- 
halo term), while C|g('') describes pairs occupying differ- 
ent haloes (two-halo term). 

To compute the one-halo term, we need to count 
all galaxy pairs {N{N — l))/2 separated by a distance 
r ± dr/2 within individual halos of mass Afh, following 
a pair distribution X{r)dr weighted by the abundance of 
distinct halos, i/ih, and normalized by the mean galaxy 
number density rig, 

{N{N^1)) 



2'Kr'^n'^ jQ 



-X{r)MMMm 



The contribution to the mean number of galaxy pairs 
from central-satellite pairs and satellite-satellite pairs is 
given by 



{N{N-l)) 



\{r)dr = {Nc){Ns)\cAr)dr 
, {N,{N, - 1)) 



\sAr)dr. (21) 



We assume that central-satellite pairs follow a pair distri- 
bution function Ac,s(r)(ir = ATTp^Yw{M\i,r)r'^dr, where 
PNFw(-^h,'') is the normalized NFW halo density pro- 
file. The satellite-satellite pair distribution. As, s(r)(ir, is 



then the normalized density profile convolved with itself, 
that is, As, s(r)cir = 47rANFw(-Mh, ?')''^c??', where Anfw is 
the NFW profile convolved with its elf. An analy tic ex- 
pression for ANFw(Afh, r) is given bv lSheth et ail f2001'). 
Both pnfw and Anfw depend on the halo concentration 
parameter, cnfw- N-body numerical simulations show 
that this parameter weakly anti-correlates with mass, 
Cnfw — a — 6xlogMh, though with a large scatter. 

Based on resu lts of TV-body (iKravtsov et al.|[2004[) and 
hydrodynamic (jZheng et al.l 120051 ) simulations, we will 
assume that the number of satellite-satellite pairs follow 
a Poisson distribution with mean (A^s)^ = {Ns{Ns — 1)). 
This is also supported by t he analysis based on a large 
catalo g of galaxy groups by lYang. Mo fc van den BoschI 

For the two-halo term, where r > 2Rh{Mh), all pairs 
must come from galaxies in separate halos. We compute 
the two-halo term from th e non-linear matte r correlation 
function, ,^,„(r) following (jSmith et al.ll2003[) : 



eM^bie{r)U{r). 



(22) 



wher e Cir) is the scale dependence of dark matter halo 
bias (|Tinker et al.ll2005L see their Eq. B7), and. 



bg = — b{MA{N{> M,\Mi,))MMh)dAh, (23) 

ng Jq 

is the galaxy bias with 6(Afh) being the halo bias function 
(|Sheth. Mo fc Tormenl[2Q0l . 

Once we have calculated £,gg{r), we relate it to the 
projected two-point correlation function (2PCF), Wp{rp), 

by 

Wpirp) = 2 r igg{^rl + x^)dx. (24) 



In this model, Eqs. (f20H24|) relate the observed 2PCF 
to the central and satellite occupational number distri- 
butions, which on their own are related to the central 
and satellite-(sub)halo mass relations. In consequence, 
the correlation function is related to the total GSMF and 
its decomposition into centrals and satellites. Therefore, 
since the GSMFs and PCFs are tightly connected, any 
combination of these observational constraints is not ex- 
pected to provide independent constrains on the mass re- 
lations and the occupational number distributions. How- 
ever, we expect that the uncertainties in the determina- 
tions of these functions are reduced as more observational 
constraints are introduced. We will explore this question 
in more detail in §§4.1. 

2.3. Parameters in the model 

Ultimately, our model, which in total consists of ten 
free parameters -if CTc, Cs, and the cnfw — -^h relation 
are fixed- constrains the central and satellite stellar-to- 
(sub)halo mass relations. Five parameters are to model 
the CHMR (Eq. [5]): Afi.c, fc, ccc, ^c, and "fc] and five 
more to model the SSMR (and therefore the satellite oc- 
cupational numbers): mi,s, Cs, cts, <^sj and 7s. Note that 
the success of our model relies on the ability to choose a 
parametric description of the M*-Afh and Af^-mgub re- 
lations (Eq. [SJ , such that the observed total GSMF and 
its decomposition into centrals and satellites are well- 
reproduced. As discussed previously, the main motiva- 
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TABLE 1 

Constraints 



Constraints: 




1 1 't" C /^"i" 


Satellite GSMF 


Central GSMF Total GSMF 2PCF 


A 
B 
C 
Bl 


YMB09 
X 

YMB09 
X 


YMB09 - X x'i&^"'') + x'i<f'LTn 
X YMB09 Yll x'(<^rir°') + x'(<iL) 

YMB09 - Yll x2(0™') + X^('^?;r°^) + X^KlL) 
X BGD08 Yll x'(<^Sf°°') + x'(<^bL) 


Predictions: 




Data set 


Satellite GSMF 


Central GSMF Total GSMF 2PCF sat. CSMF CHMR & SSMR 


A 
B 
C 
Bl 


X 

/ 
X 

/ 


X — / / / 
/XX/ / 
XXX/ / 
/XX/ / 



tion for the functional forms chosen here is that they are 
able to reproduce Schechter-like GSMFs accuratel y. 

Using a SDSS halo-based group catalog. IYMB09I found 
that the intrinsic scatter around the CHMR is approxi- 
mately independent of halo mass and log-normally dis- 
tributed, with a mean width of (Tc(logMh)=0.173 dex. 
This result i s also supported by stud ies of satellite-galaxy 
kinematics (iMore et al.l 1200 9'. 2011) and analysis using 
HOD models (I Yang. Mo fc van den Bosch 2003: Cooravl 
120061 : iCacciato et alll2009l : iLeauthaud et al.ll2012| ). Ad- 
ditionally, AMT results are able to reproduce the GSMF 
and the spatial clustering of galaxies simultaneously 
when using cTc =const. (e.g.. lMoster et aLll2010l:rRDA12l : 
[Reddick et al, 2012). On the other hand, the scatter cr^ 
around t he SSMR has not yet been discussed in the liter- 
ature. In lRDA12l it is assumed to be the same as for cen- 
tral galaxies, giving results consistent with the observed 
projected 2PCFs and satellite CSMFs. Having said that, 
we assume the intrinsic scatters ac and as to be indepen- 
dent of halo mass and equal to 0.173 dex. Nevertheless, 
as we have discussed, the constraints provided by the 
GSMF decomposed into centrals and satellites and the 
projected 2PCFs are not independent but rather they 
are complementary. Therefore, when using all these con- 
straints, it may be possible to leave Cc and/or as as free 
parameters. We will perform this exercise in Section 3.3. 
Finally, for the relation of the concentration parameter 
cn fw with mass, we use t he fit to numerical simulations 
bv lMunoz-Cuartas et all (|2011| ). 

2.4. Observational data sets and strategy 

A combination of the total, central, and satellite 
GSMFs, and the projected 2PCF for different Af* bins 
are necessary to constrain our model. In the following, 
we will experiment with different combinations of these 
data. We wish to understand how the stellar-(sub)halo 
mass relations vary depending on the combination of 
observational data used to constrain them. In partic- 
ular, we would like to explore whether the uncertainty in 
the model parameters drops significantly by introducing 
more observational constraints. 

The different observational data to be used for con- 
straining the model parameters are as follows: 

• The IYMB09I GSMF decomposed into central and 
satellite galaxies. These data were obtained from a 



large halo-based galaxy group catalog constructed 
in Yang et al. ( 2007) from the SDSS DR4 (they de- 
fine a central galaxy as the most massive galaxy in 
a group with the remaining galaxies being satel- 
lites). Both the central and the satellite GSMF are 
well-described by Schechter functions, with central 
galaxies being the more abundant population at 
all masses, at least above the low-mass limit of the 
sample, log(M*/MQ) = 8.4. 

• The total IBGD08I GSMF, which is weh described 
by a double Schechter function. This GSMF is 
steeper at the low-mass end than the IYMB09I 
GSMF (See also IDrorv et~all 120091) . IBGD08I have 
actually extended the GSMF to a lower limit, 
log(M,/Mo) — 7.4, by introducing a surface- 
brightness completeness correction. 

• Th e projected 2PCFs determi ned in five Af, bins 
by lYang et all (|2012l hereafter IY12I ) based on the 
SDSS DR7. 

The combinations explored to constrain the model pa- 
rameters consists of four data sets: 

Set A consists of the .YMB09, central and satellite 
GSMFs, and is used to constrain the model parameters 
of our extended AMT; in this case, the projected 2PCFs 
in va rious ma ss bins are pred icted . Set B consists of the 
toia/lYMBOl GSMF and the ¥15 projected 2PCFs, and 
this is used to constrain our full combined model; the 
GSMF decomposed into centrals and satellites is a pre- 
diction. Set Bl is similar to set B but instead of the 
IYMB09I GSMF, the lBaldrv. Glazebrook fc Drived (l2008l 
hereafter BGD08) GSMF is used. Set C consists of ah the 
available data: the YMB09 GSMF decomposed into cen- 
trals/satellites and the IY12I projected 2PCF determined 
in different Af, bins; this data set over-constrains our full 
combined model. 

We notice that fiber collisions in the SDSS data under- 
lying the group catalog may introduce an extra source of 
uncertainty when using the satellite GSMF for constrain- 
ing the parameters in sets A and C. However, this seems 
to be a small effect at most since lYMBO^ show that satel- 
lite CSMFs with a correction for fiber collisions are only 
marginally different. It is also important to highlight 
that the authors report only the diagonal elements of the 
covariance matrix for the projected 2PCFs. We expect 
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Fig. 1. — The GSMF for all, central, and satellite galaxies. For 
clarity, in each panel the central GSMF was shifted down by 1 dex, 
and the satellite GSMF by 2 dex. Upper panel: Model results for 
sets A (blue long dashed line), B (red dot-short-dashed line), and 
C (solid line), compared to the observed YMB09 GSMFs for all 
(filled circles), central (filled squares), and satellite galaxies (filled 
triangles). For sets A and C the curves are just the best joint 
fit to the data, while for set B are model predictions. The shaded 
areas correspond to the standard deviation of the 1.5 X 10® MCMC 
models for set B. Lower panel: Same as upper panel but for the set 
Bl (green dot-long-dashed line and shaded areas). The predictions 
for set B are repeated in this panel (red dot-dashed line). The 
corresponding observational total BGD08 GSMF is showed with 
solid circles and error bars. The inset shows how the central and 
the satellite GSMFs add up to give the total GSMF in the case of 
set Bl. 

that the the full covariance matrix would reduce possible 
systematic errors and extra uncertainties in some of the 
constrained parameters. As discussed in §3.1, the lack of 
the covariance matrix seems to affect the results for the 
abundance of satellite galaxies, however, these effects are 
of minor importance. 

Table [T] summarizes the different data sets presented 
above and specifies where the observables are used as 
constraints and in where they are predicted by the model. 

We use Markov Chain Monte Carlo (MCMC) methods 
for sampling the best fit parameters that maximize the 
likelihood function C oc exp(— Each MCMC chain 
consist of 1.5 x 10^ elements. See Appendix A for details 
on the full procedure. 



3. THE ANALYSIS 

In our model the central/satellite GSMFs are tightly 
connected to the projected 2PCFs in such a way that 
given the former the latter can be inferred and vice versa 
(§§3.1). This connection passes through the underlying 
halo/subhalo statistics and the stellar-to-(sub)halo mass 
relations. Therefore, the latter, together with the satel- 
lite CSMF, are predictions in all cases (§§3.2 and §4, 
respectively) . 

3.1. The GSMFs & projected 2PCFs 

Figure [T] shows the model results for the central, satel- 
lite, and total GSMFs, while Fig. [2] shows the projected 
2PCFs in different mass bins. The observational data 
are also plotted in these figures (symbols with error bars) . 
The shaded regions in the figures are the resulting model- 
fit standard deviations calculated from the 1.5 x 10^ 
MCMC models for sets B and Bl (the upper and lower 
panels of Fig. [U respectively), and for set A (Fig. [2]). 
These standard deviations are associated with the un- 
certainties in the model parameters, and are produced 
partially by the uncertainties in the observations used 
to constrain the model. For a discussion on the model 
scatter see Sect. 5.1. 

For set A, which is constrained by the IYMB09I satel- 
lite and central GSMFs (solid symbols in Fig. [1] red 
dot-dashed curves are just the joint best fits to data), 
the model predicts the projected 2PCFs (red dot-dashed 
curves with shaded areas in Fig. [5]). Both the amplitude 
and the shape of the predicted projected 2PCFs are in 
excellent agre ement with observations (crosses with er- 
ror bars, IY12D at each stellar mass bin pl otted in Fig. [2j 
This result is not surprising as shown in RDA12I . What 
is interesting, however, is that the standard deviations 
are consistent with the errors reported in the observa- 
tions. Note that the 1-halo term is the zone with the 
largest uncertainty, which arises directly from the un- 
certainty in the satellite GSMF. For set B, whi ch is 
constrained by the total ■yMBOS GSMF and the IY12I 
projected 2PCFs (black curves are just the joint best 
fits to data), the model predicts the central and satellite 
GSMFs (black curves and gray shaded regions in the up- 
per panel of Fig. [T]). The model predictions agree very 
well with observations, and therefore with set A. These 
results show that the central/ satellite GSMFs and the 
2PGFs are tightly connected in such a way that given one, 
the other can he inferred through our model. Observe that 
in Fig. [U the standard deviations are consistent with the 
error bars both for the satellite and the central GSMF. 
The former has the largest uncertainties. Therefore, the 
lack of information from the projected 2PCF covariance 
matrix seems to affect mostly the abundance of satellite 
galaxies or equivalently, the 1-halo term in the projected 
2PCF. 

Set Bl (lower panel of Fig. [IJ is similar to set B, but 
the BGD08 total GSMF is used as a constraint instead 
of lYMBOa data. Therefore, the model total GSMF and 
2PCFs (green dot-long-dashed curves in Figs. [T] and [21 
respectively) are just the joint best fits to the data, but 
the central and satellite GSMFs are predicted (green 
dot-long-dashed curves with gray s haded ar eas in the 
lower panel of Fig. [J). Note that the |BGD08| GSMF ex- 
tends to lower stellar masses. The resulting slope of the 
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Fig. 2. — Projected 2PCFs in five stellar mass bins corresponding to the data sets A, B, C, and Bl (see Table 1), and to the lYlj 
observational determinations (crosses with error bars). The lines corresponding to each set are indicated in the last panel. For set A, the 
plotted 2PCFs are predictions, while for the rest of the sets are just the joint best fits. The shaded area show the standard deviation for 
the 1.5 X 10^ MCMC models. 



satellite GSMF at the faint end (log(M,/MQ) < 9.6), is 
a ^ —1.5, which is steeper than set B, a ~ —1-2, and the 
bump around M*-^ 8 x lO^^M© in the central GSMF is 
more pronounced. A steeper total GSMF implies a ma- 
jor contribution of satellite galaxies to the total GSMF 
at low masses. 

Finally, for set C, neither the central/satellite GSMFs 
nor the projected 2PCFs at different mass bins are pre- 
dicted but rather employed to constrain the model pa- 
rameters. Therefore, the blue long-dashed curves shown 
for set C in Figs. [T] and [2] are just the joint best fits 
to the observations; they are not predictions. For this 
set, the predictions are the constraints on the stellar- 
to-(sub)halo mass relations. The question now is how 
different can these relations and their uncertainties be 
from those inferred using the other data sets. 

3.2. Mass relations 

In the upper panels of Fig. [3] we plot the central, satel- 
lite, and average stellar-to-(sub)halo mass ratios (stellar 
mass fractions, /*) as a function of the (sub)halo mass 
obtained for each data set listed in Table [TJ The stel- 
lar mass fractions are obtained directly from the corre- 
sponding mass relations. Table[2]hsts the best fit MCMC 
model parameters of these relations for each of the sets. 
We use Eq. to compute the average Recall that 
the average relation, (log M^,{M)), is conceptually what 
is commonly obtained with the standard AMT. How- 
ever, in the latter case it is not possible to distinguish 
the mass relations for centrals and satellites, and it is 
common practice to assume them equal. As shown in 
IRDA121 this assumption is not correct. 

In general, we find that the shape of the stellar frac- 
tions for both the centrals/halos and satellites/subhalos 
rises steeply at low masses, reaching a maximum and 



then declines roughly as a power law towards higher 
masses. We do not find significant differences among the 
stellar mass fractions obtained for sets A, B and C. Ob- 
serve how all of them lie well within the la uncertainty 
which is dominated by the systematic uncertainty in the 
stellar mass d etermination (~ 0.25 dex, light shade d area 
in Fig. [3| see iBehroozi. Conrov fc Wechsled IMoh . All 
these relations even lie well within the standard devia- 
tion of the MCMC models, shown as a dark gray shaded 
area in set C (the others being very similar) . In the lower 
panels of Fig. [3] we plot the the standard deviations as a 
function of (sub)halo mass for each set, which we discuss 
in detail in §5.1. 

We arrive at two important implications: (1) the very 
small standard deviations obtained for the stellar-to- 
(sub)halo mass relations implies that the assumption 
that on average there is a monotonic relation between 
galaxy and (sub)halo masses is consistent with the data; 
(2) the result that set A and set B lead to very sim- 
ilar mass relations confirms that matching abundances 
is equivalent to matchi ng occup ational numbers and vice 
versa, as suggested in IRDA121 Therefore, constraining 
the model parameters with all the observational informa- 
tion, as in set C, should lead again to the same stellar- 
to-(sub)halo mass relations as in sets A and B. Indeed, 
this is what we obtain. 

The central /*-Mh relations for sets A-C at the low 
(high) mass end scale roughly as /, oc M^-^ (/* cx 
M^^'"^). The average f^-M]^ relations are such that 
they lie above but closer to centrals, simply because they 
are the dominant population. Instead, the satellite 
TTisub relations are quite different to centrals, both in 
the amplitude and in the location of the maximum of 
The maximum shifts from log(Afh/M0)« 11.9 to 
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TABLE 2 

Fit parameters 



Central galaxies; 




Data set 


log A/i,c 


stdcv 


logec 


stdcv 




stdev 


<5c 


stdev 


7c 


stdev 


A 


11.477 


0.073 


-1.582 


0.050 


2.252 


0.461 


3.558 


0.206 


0.485 


0.044 


B 


11.480 


0.066 


-1.580 


0.038 


1.982 


0.338 


3.530 


0.198 


0.491 


0.040 


C 


11.493 


0.068 


-1.600 


0.047 


2.138 


0.417 


3.572 


0.202 


0.487 


0.043 


Bl 


11.676 


0.056 


-1.475 


0.027 


2.056 


0.110 


2.454 


0.183 


0.514 


0.047 


Satellite galaxies: 




Data set 


log mi, s 


stdev 


logEs 


stdev 




stdev 




stdcv 


7s 


stdcv 


A 


10.761 


0.069 


-0.992 


0.063 


2.469 


0.710 


3.616 


0.260 


0.435 


0.077 


B 


10.773 


0.088 


-0.951 


0.052 


2.670 


0.792 


3.612 


0.255 


0.437 


0.075 


C 


10.775 


0.064 


-0.957 


0.052 


2.474 


0.657 


3.586 


0.260 


0.423 


0.071 


Bl 


11.017 


0.90 


-0.709 


0.044 


2.322 


0.191 


1.667 


0.225 


0.993 


0.133 
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Fig. 3. — Upp er panels: From left to right, the constrained stellar mass fractions of central and satellites, and of the number-density 
average (Eq. list of both. The lines corresponding to each set are indicated inside the panels. Short-dashed curves in the second and third 
panels are the constrained mass relations when the subhalo mass is defined at its accretion time. The systematic uncertainty due to the 
uncertainty in the stellar mass determination (0.25 dex) is shown with the light-gray shaded areas. Gray dashed areas indicate the MCMC 
model-fit standard deviation in the case of set C. Lower panels: The MCMC model-fit standard deviations for each data set. The short 
dashed lines in the right and middle lower panels are the intrinsic scatters, CTc and CTs, constrained for the set C assuming them to be 
constant. The color shaded area show the standard deviations of these values. 



Iog(r7isub/M0)« 11.2. These differences are basically 
due to the fact that subhalos lose mass due to tidal 
striping (on average 60-65% of the mass since the ac- 
cretion for subh alos host ing satel lites less mas sive than 
2 X 10" M^ : IRDA12I: see alsolVale fc Qstriker..2004: 



[Weinberg et"alli2008l: iWatson. Berlind fc ZentneHl2012[ ) 
However, even when the subhalo mass at accretion time 
is used, some differences remain, showing that the as- 
sumptions about evolution made in order to construct 
the nominal SSMR for this case are roughly but not ex- 
actly obeyed. 

In Fig. [3] we plot the model results for subhalo mass 
defined at accretion time for the set C (black dashed 
curve 0. The /*-msub (or SSMR) relation now lies close 
to the central /*-Mh (or CHMR) relation. Recall that 

^ The SubhCMF for subhalo s define d at the accre- 
tion time given bv IBoyl an-Kol chin et al.l 1 12010 . see also 



Giocoli. Tormen &: van den BoschI 120081 ) has been used (see 



RDA12I for details) 



for connecting the present-day observed satellite to 
(sub)halo masses at their different accretion epochs, one 
implicitly assumes that the satellite stellar masses change 
in a way that at z = the SSMR is equal to the CHMR. 
Our ignorance about how the satellite masses evolve in- 
troduces an extra uncertainty in the determination of 
the SSMR when the subhalo mass at the accretion time is 
us eddYllilRDATl . In any case, as extensively discussed 
m IRDA12I . for one or another definition of subhalo mass, 
there is a unique but different average SSMR for which 
the satellite GSMF and CSMF, and the correlation func- 
tions are in agreement with observations. Nevertheless, 
when the SSMR is assumed to be equal to the CHMR, 
the predicted satellite GSMF and CSMF, and correlation 
functions depart from observations. They do so more 
strongly for the observation-time definition, mobs, and 
less strongly for th e accret ion-time definition, mace (see 
Figs. 1, 3, and 4 in lRDA12[ ). 

The mass relations for set Bl (green dot-dashed curves) 
are somewhat different to those of set B: at high masses 
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the central (and the average) f^,-M\i relation is steeper 
than in set B, and at low masses the satellite /*~msub 
relati on is sha llower. Recall that the massive-end of the 
total IBGD08I GSMF (set Bl) decays faster than in the 
IYMB09' GSMF (set B; see Fig. H]). Consequently, at a 
fixed Afh the CHMR for set Bl is systematically lower 
than for set B. At low masses, the GSMF in set Bl is 
steeper than in set B, causing this a steeper satellite 
GSMF and therefore a shallower decay of as msub 
decreases as compared to set B. 

3.3. Constraining the intrinsic scatter of the 
stellar-to-(sub)halo mass relations 

The results presented above are obtained under the 
assumption of lognormal intrinsic scatter around the 
CHMR and the SSMR with constant la widths of 0.173 
dex for both relations. Our model is over-constrained 
by observations in set C. Therefore, we may leave one 
or both of the intrinsic scatters as free parameters, but 
keep the assumption that they are constant, that is, in- 
dependent of (sub)halo mass. 

The results of leaving only one of CTc or free are 
very similar to leaving both free at the same time. The 
MCMC algorithm in the latter case constrains the in- 
trinsic scatters to be CTc = 0.168 ± 0.051 dex and cts = 
0.172 ± 0.057 dex. These values are surprisingly close to 
those we have assumed. These values are plotted in the 
lower panels of Fig. [31 The constrained mass relation 
parameters also remain almost the same. So, under the 
assumption of lognormal distributed and constant intrin- 
sic scatters, our results confirm previously estimated val- 
ues of the scatter for central galaxies, and predict similar 
values for the scatter around the mass relation of satellite 
galaxies. 

4. OCCUPATIONAL STATISTICS 

The mass relations we obtain in the previous section 
allow us to explore several implications that come about 
naturally within the framework of our model, in partic- 
ular the halo occupational statistics. In this section we 
study the implications for the conditional stellar mass 
functions, the mass distribution of the most massive 
satellite at a fixed halo mass, and the occurrence of 
Magellanic-Clouds (MC) sized galaxies in MW-sized ha- 
los. 



4.1. The conditional stellar mass functions 

In Fig. m we plot the resulting central galaxy mass 
probability distributions, Pccn, and the satellite CSMFs, 
<I>sat, in eight halo mass bins both for set C (dark red 
and dark gray areas, respectively) and set Bl (orange and 
light gray areas, respectively). Because the predictions 
for sets A and B are very similar to those of set C we 
do not plot them separately. In fact, what is plotted 
in Fig. 0] are the standard deviations (scatters) of the 
MCMC models for each set, which for Pccn are actually 
very small. Pcen is the probability distribution for a halo 
of a fixed mass to host a central galaxy of a given stellar 
mass (eq. S]), while $sat refers to the mean number of 
satellite galaxies residing in a host halo of a fixed mass 
(eq. [12]). We compute Pcen averaged in each [M^^jMii^] 



bin as: 



Pccn(Af*|Mh)0h(Mh)dMh 



(Pr, 



(25) 



while for satellites, the averaged GSMF is given by: 



/j^"^ $sat(M,|Mh)(/)h(Mh)dA4 



(26) 



As seen in Fig. [4l the smaller the halo mass, the larger is 
the stellar mass gap between the most common central 
galaxy and the most abundant satellites. In other words, 
on average, as smaller is the halo, the larger is the ratio 
of the central galaxy mass to the masses of the satellite 
population. 

In Fig. [4| we also s how the corresponding observational 
results by IYMB09I for centrals (crosses) and satellites 
(filled squares). The agreement between the model pre- 
dictions for set C and the observational data is remark- 
able. However, some marginal differences are observed. 
As the halo mass decreases, the width of the central prob- 
ability distribution is systematically somewhat broader, 
and therefore its amplitude is lower compared with the 
YMB09 data. This could be due to the assumption that 
the intrinsic scatter CTc is independent of Mh. There are 
some pieces of evidence that ar. slightly dep e nds on Mh 
as discussed in 'Yang. Mo fc va n den BoschI ()2009al see 
also lZheng. Co il & Zchavi 2007). 

Regarding the satellite CSMFs, the abundance of very 
massive satellites is slightly, but systematically, overes- 
timated for Mh ^ IO^^Mq. This was noted already in 
IRDA121 which suggest that a possible reason is due to 
the assumption that the intrinsic scatter as is indepen- 
dent of Mh. We have explored this possibility, and found 
that as Mh decreases, the scatter CTs does tend to zero 
in order to reproduce the observations. Such a behav- 
ior is not expected at all. Another possibility, and the 
most likely, is th at the |Y MB09 satell ite GSM F is under- 
estimated (e.g., iSkibba et al.i i20"TTI: IRDA12D . although 
the results from set B indicate that the obtained satel- 
lite GSMF is consistent with observations. In any case, 
the excess of massive satellites in low mass halos does 
not contribute significantly to the total mean density of 
galaxies. 

For set Bl, we observe that Pccn is shifted to slightly 
lower values of M* as halo mass increases when compar- 
ing with observations (and set C). This is a consequence 
of the observed trends of the M^-Mh relations between 
set Bl and G (see Fig. [3|), and it is ultimately_^ related 
to the fact that the high- mass end of the BGDOS GSMF 
decreases faster than the IYMB09I GSMF. On the other 
hand, the satellite CSMFs for set Bl are slightly steeper 
at low stellar masses to those of set C. This is a conse- 
quence of the IBGD08I GSMF being steeper than that of 
IYMB09I at low masses. Note that the uncertainty in the 
CSMFs for set Bl dramatically increases at the lowest 
masses. This is because at these masses there is no in- 
formation on the 2PCF, so that the total GSMF alone 
poorly constraints the CSMFs. Set Bl also overestimates 
the abundance of massive satellites in low mass halos. 

4.2. Probability distributions of satellites 
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Fig. 4. — Central mass probability distributions and satellite CSMFs, .Fccn 

and 'I'sat, in eight halo mass bins for set C (dark red and 
dark gray shaded areas, respectively) and set Bl (orange and light gray shaded areas, respectively). The shaded areas correspon d to the 
standard deviation of the MCMC model-fits for each set, which for Pccn are actually very thin. The observational inferences bv IYMB09I 
are plotted with crosses forPccn, and with filled squares for the satellite CSMFs. Their halo masses were converted to match our virial 
definition. 



Once we have constrained the distribution of sateUite 
galaxies, we can predict the probabihties of having N 
satelhtes of a fixed M* or in a particular M, range as a 
function of Mh. It is assumed in our model that the sec- 
ond moment of the satellite distribution follows a Poisso- 
nian distribution (see §i 32.2|) : the second moment is neces- 
sary to estimate chance probabilities for any given num- 
ber of satellites. The probability of finding N satellites 
with stellar mass above M, in a host halo of mass Afh is 
then given by: 



where for convenience 
(Af,(>M,|Mh)). 



m ' 

we redefined 



(27) 



4.2.1. The most massive satellite mass distribution 

We can use the satellite CSMF and Eq. (P71) to compute 
the mass probability distribution of the most massive 
satellite in halos of different masses. This is given by 
the followin g expression (e.g., iMilosavlievic et ahl 120061 : 
IVale fc Ostriker.200a : 



Pi(M,|Mh)rfM, 



X e 



(28) 



Note that -Pi(M,|Mh)dM, = P(> 1), where P{> 1) 
is the probability of finding at least one satellite galaxy 
more massive than Af*, P(> 1) = 1 — i-'(O). 

The results are shown in Fig. [Sj where dashed and solid 
lines are for the most massive satellite and central galaxy 
mass distributions, respectively. The latter is by assump- 
tion a lognormal function of width <Tc = 0.173 dex. The 
shaded areas indicate the 68% width of the corresponding 



distributions. As seen in Fig. [5l the mass distribution of 
the most massive satellite changes with Afh: in massive 
halos, it becomes closer to the distribution of the central 
galaxy, while in lower mass halos it tends towards small 
satellite masses compared to the central. This difference 
in masses, expressed in magnitudes, is referred in the lit- 
erature of galaxy groups/clusters as the magnitude gap. 
The behavior seen in Fig. [5] is just a consequence of the 
satellite CSMFs showed in Fig. S] 

For halos larger than ^ 1 — 3 x 10^"^ M0 , the mean and 
standard deviation of the most massive satellite mass 
distribution slightly increase and decrease with Afh, re- 
spectively, while for smaller masses, the mean value of 
7'i(Af*|Afh) strongly decreases as Afh decreases (faster 
than the central galaxy mass does) and the standard de- 
viation increases. This transition is just at the mass cor- 
responding to small classical galaxy groups. Therefore, 
our result seems to be a consequence of the fact that in 
groups, the larger the system's mass is, the smaller is the 
collision cross sections for big galaxies of close masses so 
that more of them survive. Instead, in galaxy-sized ha- 
los, due to their smaller velocity dispersions, the galaxy 
collision cross sections are large in such a way that the 
largest galaxies probably merged into one dominant cen- 
tral. Besides, the smaller the halo, the earlier most of 
its mass assembled on average; hence, the (wet) merg- 
ers of the most massive galaxies in the halo would have 
happened early. However, a fraction of the galaxy-sized 
halos, while on average dynamically old, can accrete mas- 
sive satellites late. This could partially explain the wide 
distribution of masses of the second most massive satel- 
lite in MW-sized halos. For example, as seen in Fig. [5l 
the probability for these halos to have the most massive 
satellite ^ 5 times larger than the LMC is close to the 
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Fig. 5. — Stellar mass distributions of centrals (solid line) and the most massive satellites (dashed line) in seven different host halo 
masses. Shaded areas indicate the 68% of the corresponding distributions. The central galaxy masses on average are not part of the la 
most-massive satellite distributions. For masses above ~ 3 X 10^"^ Mq, only approximately 15% of both distributions overlap. For smaller 
masses, this fraction rapidly decreases down to 3% at Mh = 10^^ Mq. The fraction of cases with overlapping distributions are expected to 
correspond to those cases where the central galaxy mass is a statistical realization of the most-massive satellite distribution. 



probability to have this satehite as massive as the LMC. 

From Fig. |4] we see that the mass of the central galaxy 
in the largest halos could be the statistical extreme of 
the satellite CSMF. The question whether the brightest 
cluster galaxies are a mere statistical extreme of the lu- 
minosity function in cluster s or they form a different class 
is a longstaiiding one (e.g.. iTremaine fc Richstond 119771 : 
iHearin et aDl2012l : Moid^Ql^. According to Fig.[5l the 
mass distributions of the most massive satellite and cen- 
tral galaxy become closer as Afh increases. However, 
quantitatively, we see that the mean of ■Pi(M, |Mh) lies 
outside of the Itr of the central galaxy mass distribu- 
tion even for a 10^^ Mq halo, i.e., the central and the 
most massive satellite galaxy, on average, are not ex- 
pected to be drawn from the same exponentially decay- 
ing mass function; this cri terion is similar to the observa - 
tional one introduced bv ITremaine fc Richstond ()1977[ ). 
For a similar conclu sion but using a different method see 
IHearin et aP (|2012| ). 

We can estimate the fraction of systems where both 
mass distributions overlap, and consider that this frac- 
tion corresponds to the cases where the most massive 
satellite and the central galaxy are drawn from the same 
distribution. For masses above ~ 3 x 10^'^ Mq, approxi- 
mately 15% of halos would have central galaxies that are 
not statistically peculiar with respect to the satellites. 
For smaller masses, this fraction rapidly decreases down 
to 3% at Mh = 10^^ M0. In conclusion, most of centrals 
seem to form a statistically different class of galaxies with 
respect to the satellites at all halo masses, with a small 
fraction of cases, up to ~ 15% in cluster-sized halos, be- 
ing the exception, that is to say the centrals in these 



cases could be a statistical realization of the high-mass 
end of the sateUite CSMF. 

In order to compare our population statistics in detail 
with observations, the systems should be selected by the 
central galaxy Af* and/or group richness instead of the 
halo mass. We will will carry out this exercise elsewhere 
by using a mock catalog based on the the distributions 
constrained with our model. 

4.2.2. The probability of Milky Way-Magellanic Clouds 

systems 

Our model results and Eq. ([^5]) can be used to com- 
pute the probability of having one, two, or N Magellanic 
Clouds (MCs) satellites in MW-sized halos. We calcu- 
late these probabilities for a range of possible MW-halo 
masses discussed in the literature: (0.7, 1, 2, 3) x 10^^ Mq. 
We use Mt.mo = 2.3 x IO^Mq and Msmc = 5-3 x lO^M© 
(j James fc Ivorvll201lD for the stellar masses of the MCs. 

Firstly, we are interested in calculating statistics that 
can be compared with o bservations. From a large SDSS 
sample, iLiu et aLl ()201lD have estimated the fraction of 
isolated galaxies with MW-like luminosities that do not 
have (A'^MC = 0) and that have A^mc = 1,2,3,4,5, or 
6 MC-sized satellites. We calculate similar probabilities 
for each of the halo masses incntioned above. In order to 
compare with iLiu et al.l (|26lll ) , we do not exclude sys- 
tems with s atellites more m assive than the LMC. The 
results from ILiu et ahl (|2011[ ) , for a search of MC-sized 
satellites up to 150 kpc around the primary, are plotted 
with crosses in Fig. [5] (from their Table lj3- Note that in 

^ The selection criteria and observational corrections for search- 



Galaxy-halo/ subhalo connection 



13 



our case satellites ar e counted i nside the host virial radius 
(~ 200 - 300 kpc). iLiu et alj poTll ) plot in their Fig. 8 
the probabilities with a search radius up to 250 kpc only 
for iVMc = 0, 1,2,3. We reproduce these measurements 
in Fig. [5] with gray symbols and error bars. 

Our predicted probabilities for set C are plotted in 
Fig. m The probabilities of MW-likc halos hosting MC- 
sized luminous satellites (but including possible larger 
satellites) increa ses with Mh- Recall that in the case of 
ILiu et all (|2011| ) the central galaxy luminosity is fixed. 
In this sense, our results suggest that this luminosity 
(Afo.ir — —21.2 ± 0.2 mag) can be associated to halos 
of different masses: for those galaxies with 1 or 2 MC- 
sized satellites, the preferred masses are w 1 — 2 x 10^^ 
Mq, while for those rarer systems with 3 to 6 MC-sized 
satellites, the preferred masses are > 2 x 10^^ M©. In- 
terestingly enough, from the inverse of the CHMR (set 
C), taking into account the intrinsic scatter around this 
relation, the halo masses corresponding to the MW stel- 
lar mass, log(M*/M0) = 10.74±0.1, are log(Mh/M0) = 
12.31 ± 0.22. Therefore, the rare halos that host 1 or 
2 MC-sized satellites are those on the low-mass side of 
the halo mass distribution given the MW central stel- 
lar mass, while the much more rarer halos hosting 3 to 
6 MC-sized satellites are those in the high-mass end of 
such a distribution. 

The probability of the concrete case of two MC-sized 
satellites (iVMc=2; but not excluding the possibility of 
satellites larger than the LMC) in a MW-like halo of 
2 X 10^2 is 5.4% for set C (see Fig. E]). If we exclude 
now from our model predictions the possibility of having 
satellites larger than the LMC (as it happens in the MW 
system), then the probability drops to a upper limit of 
2.2%. 

The statistics of finding MW-sized galaxies with satel- 
lites in the concrete mass range of the MCs is limited. 
This statistics is actually part of the more general cu- 
mulative conditional satellite mass function. Having 
this function for galaxies we may then ask, for instance, 
whether the MW is rare because it has two too massive 
satellites or because it has a deficiency of massive (larger 
than LMC) satellites with respect to the average. We 
will report results related to these questions elsewhere 
by using a mock galaxy catalog generated with the dis- 
tributions constrained here. The mock catalog will allow 
us also to infer several statistics given the central galaxy 
stellar mass in which we are interested in (e.g., the MW 
one) instead of exploring a range of possible halo masses 
as was done here. 

We conclude that the agreement between the predicted 
and observationally determined probabilities is reason- 
able within the uncertainties. Such an agreement indi- 
cates that the model is self-consist as well as consistent 
with the underlying ACDM scenario. Note that this self- 
consistency has been proven down to the scales of the MC 
galaxies and at the level of satellite population distribu- 
tions. Similar probabilities were found also using large N- 
body cosmological simulations and looking for MW-sized 
halos with subhalos that have dynamical properties simi- 

ing fo r MC-lik e satellites are ac tually quite diverse. ILiu et al] 
II 20 111 , see also IBusha et al.l 120111 ) explored the sensitivity of the 
probabilities to changes in various selection parameters and found 
that their results can be slightly different, being the most sensitiv- 
ity to the satellite search radius around the primary. 




Fig. 6. — Probability of occurrence of A^mc MC-sized satellites 
in a range of possible MW-sized host halos (different lines are for 
the different masses indicated in the pl ot) based on the results for 
set C. Observational determinations by l|Liu et al.ll2011l ) for a large 
sample of SDSS galaxies are shown with black (gray) crosses for 
distances from the host up to 150 kpc (250 kpc). The black arrows 
show how the the occurrence of MC-sized satellites change when 
the search radii goes from 150 kpc to 250 kpc from the host. 

lar to the MCs (|Bovlan-Kolchin et al.ll2010HBusha et al.l 

MB). 

5. DISCUSSION 

5.1. Robustness and model uncertainties of the 
stellar-to-(sub)halo mass relations 

The main result from Section[3]is that both the inferred 
central-halo and satellite-subhalo mass relations do not 
change for all the combinations of data sets we explored. 
In other words, these relations seem to be determined 
robustly, no matter whether only the central/satellite 
GSMFs (set A) or whether only the total GSMF and the 
2PCFs (set B) or whether all of these data (set C) are 
used. The results confirm what is expected: the GSMFs 
of central and satellites galaxies are well connected with 
the 2PCFs and both are part of a general statistical de- 
scription of the galaxy population. However, it could be 
that the uncertainties around the mass relations depend 
on the set of observables used. In particular, we expect 
that the uncertainties should be smaller when all the ob- 
servational data are used to constrain the model. 

From the results of the MCMC search over 1.5 x 10^ 
models we can identify at each (sub)halo mass the aver- 
age and its standard deviation. The average stellar 
masses for a given (sub)halo mass are indistinguishable 
to those given by the average stellar-to-(sub)halo mass 
relations constructed with the best fit parameters ob- 
tained with the MCMC method. The standard devia- 
tions can be interpreted then as the l a model-fit uncer - 
tainty around these relations (see also [More "eTalllMll) . 
This uncertainty is due to (i) the inability of the proposed 
stellar-to-(sub)halo mass relations (Eqs.[S]) to reproduce 
jointly the observational data, and (ii) the observational 
errors in these data. The dark gray areas in Figs. |3] and [7] 
correspond to the standard deviations for the set C; the 
much wider light gray areas show the scatter of 0.25 dex 
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Fig. 7. — Central-halo, satellite-subhalo, and average mass relations for sets C (solid line) and Bl (green dot-dashed line). Short-dashed 
curves in the medium and right panels are the mass relations when subhalo mass is defined at its accretion time. The systematic uncertainty 
due to stellar mass determinations is show with the light-gray shaded area. Gray dashed area indicates the standard deviation of the MCMC 
model fits in for set C. Filled circles with error bars correspond to the mass relation of central galaxies from the analysis of staked weak- 
lensing bv lMandelbaum et al -' f2006'). Orange dashed area indicates the 68% of confidence in the mass relation of central galaxies using the 
kinem atics of satellites IMor c ct al. 20113. Abundance matching results reported in Bchroozi, Wcchslcr & Conroy (2012), and Guo ct 
1120101 ) are plotted with the blue dot-short-dashed and the red long-dashed lines. For comparison we have plotted with the cyan solid lines 
in the middle and right panels the central-halo mass relations for set C. Observe how in the middle panel the SSMR for the subhalo mass 
defined at the accretion time lies above the CHMR by a factor of ~ 3, while in the right panel the nominal average mass relation at the 
accretion time is a factor of ~ 1.25 higher than the CHMR. 

attributed to the systema tic uncertainty in the determi- 
natio n of the steUar mass (jBehroozi. Conrov fc Wechsleri 

How do the model-fit uncertainties in the stellar-to- 
(sub)halo mass relations for set C compare with the other 
sets? To our surprise, the uncertainties in sets A, B, and 
Bl are as small as for set C. Actually, the uncertainties 
are smaller than the intrinsic scatter between galaxy and 
halo mass at least for halo masses larger than ~ 10^^ 
Mq. Lower panels of Fig. [3] show the MCMC standard 
deviations as a function of (sub)halo mass for all the 
sets studied here. If any, the major differences are for 
the uncertainties in the SSMR: they become large at 
large subhalo masses and are larger for set A (and Bl) 
and smaller for set C. At the smallest (sub) halo masses 
the model uncertainties for all the sets increase signifi- 
cantly (but yet below the systematic uncertainty of 0.25 
dex). This is related to the larger observational errors 
at smaller stellar masses both for the GSMFs, in par- 
ticular the one for satellites, and the projected 2PCFs. 
The small model-fit uncertainties obtained in the deter- 
mination of the central-halo and satellite-subhalo mass 
relations through our model again lead us to conclude 
that these determinations are robust. 

How do the stellar-to-(sub)halo mass relations com- 
pare with previous work? The most direct (but highly 
uncertain) methods to infer the halo masses of galax- 
ies for large samples of objects are through weak lensing 
and satellite kinematics. In the left panel of Fig. [71 we 
reproduce the results for central galaxies of (log(Af:,)) 

as a function of My, b y using stacked satellite kinemat- -■ . — — r, 

ics (|More et al.l IM . Dr. S. More kindly provided us ^i*^ those of |Guo et alj (|2010D (red lon^dashed curve) 
the data in electronic form) and of (Mh) as a func- by [ Behroozi, Wechsler fc Conroyj (1201^) (blue dot- 

tion of M, by using stacked weak lensing analysifl dashed curve). These authors obtained their relations by 

matchmg the abundances of all galaxies to abundances 



(|Mandelbaum et "all l2006t ). The latter authors inferred 
the CHMR separated into late- and early-type galaxies, 
(Afh);(M*) and (Mh)e(M*), respectively. We compute 
the average (Mh)(M*) relation for all central galaxies as 

(Mh)(M,) = /,(M,)(Mh)i(A/*) + /e(Af*)(Mh)e(Af*), 

(29) 

where //(M*) and /e(A/*) are the fraction of late- and 
early- type galaxies of stellar masses M^, i n the sam- 
ple. Note also that for the iMore et all (|2011[ ) and 
IMandelbaum et aTl (|2006D results, small corrections in 
Mh were applied in order to pass to our definition of 
virial mass, as well as in M* to be consistent with the 
Chabrier (2003) IMF a dopted here. 

As seen in Fig. [71 the IMandelbaum et al.l (|2006[) . weak- 
lensing determinations are consistent with our CHMR. 
However if one takes into account that their dependence 
of (A'/h) on Af* would be flatter at high masses in case 
it is deduced from the inverse relation (see footnote), 
then our determination for set C would be steeper. In- 
stead, the results fo r set Bl would probably b e in bet- 
ter agreemen t with IMandelbaum et all ()2006f ) at high 
masses. The iMore et al.l ( 20111 ) satellite- kinematics de- 
terminations are consistent with our results for masses 
larger than Afh '-^ 4 x 10^^ Mq, but at smaller masses 
their amplitude can be 2-3 times lower. This discrepancy 
between satellite kinema tics and other methods has been 
noted previously fe.g.. [More et all 120111 : iSkibba et al.l 
l20TlHRod7iguez-Puebla et al.ll201lD . 

In the right panels of Fig. [71 we compare the average 
stellar-to-hal o mass relat i on (E g. IT51) obtained for set C 



As widely discussed in IBehroozi, Conroy fc Wechslerl II2010I ') , 
due to the scatter, the inferences are slightly different depending 



on whether Af« is constrained as a function of Afjj or as the inverse. 
As these authors show the main difference is at the high-mass end. 
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of halos plus subhalos. In general, our average mass rela- 
tion is consistent with these previous global AMT results, 
though a direct comparison might not be fair, because we 
do not assume that the mass relations for centrals-halos 
and satellite-subhalos are the same. Also, the definition 
of subhalo mass used here (at the time of observation) 
is different to the one used in the above papers, who 
define it at the time of accretion. Hence, we also plot 
our SSMR for the subhalo mass defined at the accretion 
time in Fig. [7] (dashed curve in the central panel, set 
C) and the corresponding average mass relation (dashed 
curve in the right panel). The nominal SSMR is close 
but not equal to the present-day CHMR. We found that 
the nominal SSMR lies above the CHMR at most by a 
factor of ~ 3, while the nominal average mass relation 
is a factor of ^ 1 . 25 hig her than the CHMR, see also 
iWatson &: ConroyI ()2013[ ) . To establish the former rela- 
tion, one should know how the CHMR (at the accretion 
time, the satellite is yet a central galaxy and the sub- 
halo is a distinct halo) changes with time, and how the 
satellite mass evolved since the accretion. Assuming that 
the CHMR is the same at all epochs leads to the nominal 
SSMR to be equal to the CHMR (RDA12). The fact that 
we find both relations to be close (but not equal) implies 
then that the galaxy-halo connection changes only little 
with time. This seems to be also the situation in the cos- 
mological simulations (|Simha et al.ll2012l : iDe Rossi et al.l 
[2013). 

For set Bl, which uses the|BGD08! GSMF, the CHMR 
changes slightly its slope at low masses, while the 
SSMR becomes systematically sha llower t han in the 
case of set C. This is because the IBGD08I GSMF be- 
comes steeper at lower masses. However, when the 
density- weighted average is calculated, the slope change 
see n for the centrals is almost smea red out. For 
the iBehroozi. Wechsler fc Conrovl ()2012[ ) total (average) 
mass relation the slope change is present, presumably be- 
cause the contribution of the satellite-subhalo mass re- 
lation is not taken into account. On the other hand, if 
we use the subhalo mass at the accretion time instead 
that at the observation time, then the smearing-out of 
the slope change is less evident. 

5.2. The satellite-subhalo mass relation at the low-mass 

end 

An interesting question is how to extend the GSMFs 
and stellar-to-(sub)halo mass relations towards low 
masses, since most potential issues with the ACDM 
scenario are ha ppening at small scales. As recently 
discus sed by iBovlan-Kolchin. Bullock fc KaplinghatI 
((20T1) . the ACDM scenario can be compatible with 
the overall abundance of MW satellites, but it predicts 
subhalos that are too massive (or too concentrated) com- 
pared to dynamical observations of the brightest dwarf 
spheroidal (dSph) satellites. This can be visualized using 
the dSph stellar mass vs. subhalo maximum circular 
velocity (or mass) diagram, comparing the observations 
for the bright MW dSphs with extrapolations of total 
(centrals-|-satellites) abundance matching results to low 
masses. For a given M*, the MW dSphs have subhalo 
circular velocities (or masses) much larger (by ~ 1.5 — 2 
dex in mass) than the extrapolated AMT results. 

Our model has the advantage that it allows to con- 
strain the CHMR and the SSMR separately (Fig. ^. 



The extrapolation of the latter only is what actually 
should b e used for comparison with the MW satellites. 
IRDA12I show that if the fai nt-end extrapolation of the 
GSMF is as steep as -1.6 (IBGDOSl : IDrorv et al.l[20nl 
and is completely dominated by satellite galaxies, then 
the ACDM subhalo masses are consistent with the sub- 
halo masses of the observed MW dSphs. Here, masses are 
defined at the estimated tidal radii of the dwarf satellites 
(see Fig. 2 in IRDA12I and references the rein). By using 
our model, we are able to decompose the |BGD08| GSMF 
into satellites and centrals (set Bl; Fig. [1]). The faint- 
end slope of the satellite GSMF (down to 2.5 x 10^ 
Mq) indeed resembles the total mass function, but satel- 
lites do not dominate over centrals. Therefore, the in- 
ferred SSMR gives still too large subhalo masses (by 
0.3-0.4 dex) as compared with the tidal masses o f the 
MW dwarfs. We should note that in set Bl, the IY12I 
projected 2PCF s are used, and for stellar masses smaller 
than reported in lY12l (~ 1 x 10^ Mq), no projected 2PCF 
constraints are applied. There are some hints that the 
projected 2PCFs of galaxies at small distances (one-halo 
term, where satellites dominate) are steeper than those 
measured iniYMBOOl . especially for the smallest galaxies 
(ILi fc Whitdl2009D . If this is the case, then we can easily 
show that the satellites become more abundant in the 
GSMF and the SSMR is flatter at low masses, leading 
to a better agreement with the inferred tidal (subhalo) 
masses for the MW dSph satellites. 

5.3. Interpreting the hump of the GSMF 

Several interpretations of the shape of the to- 
tal GSMF have been offered i n the literature 
dEaldry, G lazcbrook & Driver 2008: IDrorv et al.l [20091: 
Li fc White! I2009t iBolzonella et all 120101: 'Pozzctti et"all 
20101) . In this section we will focus on interpreting the 
shape of the GSMF using arguments based on the oc- 
cupation statistics of galaxies within halos. Looking at 
Fig. [5l it becomes apparent that as halo mass increases, 
the likelihood of finding at least one satellite with a stel- 
lar mass similar to that of the central galaxy increases 
rapidly. Also, the stellar mass range covered by the satel- 
lite population is narrower and closer to that of the cen- 
tral as halo mass increases. Assuming that these features 
of the satellite population mass distribution are robust 
and have been in place since the assembly of the central, 
it follows that the central's probability of growing by ac- 
creting large (compared to itself) satellites was largest in 
high-mass halos that today occupy the bump and high- 
mass end of the mass function. 

6. CONCLUSIONS 

An statistical model that combines the AMT with the 
HOD and GSMF formalisms is presented. The model 
allows to constrain the central-halo and satellite-subhalo 
mass relations (CHMR and SSMR) separately, as well 
as the satellite CSMFs inside the halos. The ACDM 
halo mass function and subhalo conditional mass func- 
tions were used as input. From the observational point of 
view, the model works with the total GSMF and its de- 
composition into centrals and satellites, and the 2PCFs. 
Therefore, the observations used to constrain the model 
can be different combinati ons of da ta: either the cen- 
tral/sateUite GSMFs (from YMB09'; set A), or the total 
GSMF ffrom YMB09. or .BGD08.') and the lHS projected 
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2PCFs (sets B and Bl), or all the data, i.e., the GSMFs 
of centrals and satellites and the projected 2PCFs (set 
C). Our aim was to explore how sensitive are the deter- 
minations of the mass relations and their uncertainties 
to the different data set used to constrain the model, as 
well as to test the overall consistency of the observations 
with the ACDM halo/subhalo mass functions. Related to 
the latter, we explored model predictions regarding some 
satellite number distributions. The main conclusions we 
arrive at are: 

• The constrained parameters of the CHMR and SSMR 
are almost identical for all sets of data, showing that 
these relations (and therefore, also the satellite CSMFs) 
are robust with respect to what combinations of data 
are used to constrain the model. To our surprise, even 
the model-fit uncertainties in the constrained stellar-to- 
(sub)halo mass relations are very similar for the differ- 
ent combinations of data sets, including the one where 
all the data are used (set C). These uncertainties are 
smaller than the assumed intrinsic scatters (0.173 dex) 
for Mh XL 10^^ Mq, and of that order for smaller masses 
where the observational determinations of the GSMFs 
and projected 2PCFs have larger errors. 

• For set A, the projected 2PCFs are predictions, while 
for set B (and Bl), the GSMF decomposition into cen- 
trals and satellites are predictions of the model. In each 
case, these predictions agree very well with the obser- 
vations. This shows that matching central/satellite and 
(sub)halo abundances (set A) is equivalent to matching 
central/satellite and (sub)halo occupational numbers, in 
which case the 2PCFs are necessary (sets B, Bl), and 
vice versa. In both cases, the CHMR and SSMR are 
intermediate relations. The key novelty in our model 
is that both relations are constrained separately instead 
of being assumed equal. Our results show also that the 
satellite/central GSMF is tightly connected to the spatial 
clustering of the population, both at the level of the one- 
and two-halo terms, as well as with the satellite mass 
functions inside the halos. 

• For set C, neither the projected 2PCFs nor the 
GSMF decomposition are predictions, instead observa- 
tional determinations of these functions are used to con- 
strain the model. This allows us to leave the widths of the 
intrinsic scatter around the CHMR and SSMR (assumed 
independent of mass and log-normally distributed) as 
free parameters. We obtain Cc — 0.168 ± 0.051 dex and 
(Js = 0.172 ± 0.057 dex. For centrals, our result con- 
firms previous estimates, and for satellites we find that 
the intrinsic scatter is almost the same as for centrals. 

• The satellite-subhalo mass relation, where subhalo 
masses are defined at the observation time, is not equal 
to the central-halo relation. For the former, the stellar 
mass scales as Mh^'^ at the low mass-end and as Mh^'^ 
at the high- mass end (set C), while for the latter, these 
scalings go as oc Mh^'^ and Mh^ '', respectively. This 
difference is mainly due to the fact that subhalos lose 
mass (60-65%) due to tidal striping. When rrisub is de- 
fined at the accretion time, the nominal SSMR is actually 
close to the CHMR but again not equal. The SSMR lies 
above the CHMR at most by a factor of ~ 3, while the 
average mass relation is a factor of ^ 1.25 higher than 
the CHMR, implying that the CHMR likely changes lit- 



tle with time. 

• In set Bl, we use the lBGDOSi total GSMF, which ex- 
tends to masses as lowe as log(M*/MQ)=7.4. This func- 
tion is steeper at the low- M^, end an d decays faster at 
the highest masses than the 'YMBO^ GSMF. Therefore, 
the CHMR and SSMR are slightly different to those in 
set B. In particular, the lowest masses show a slight flat- 
tening as compared to results of set B. For the satellites, 
if extrapolated to even lower masses, this implies smaller 
subhalo masses for a given stellar mass than usually ob- 
tained from the standard AMT. This is diminishing the 
potential problem of too massive ACDM subhalos for the 
bright MW dSphs. 

Our model allows us to infer in a natural way any sta- 
tistical distribution for the central and satellite galaxy 
populations, as for example the satellite CSMF and the 
mass distributions and probabilities of particular sub- 
populations of satellites as a function of halo mass. The 
obtained satellite CSMFs in different halo mass bins 
agree very well with th ose inferr ed from the SDSS halo- 
based galaxy groups in lYMBOOl Moreover, we have ex- 
plored in particular two interesting statistics related to 
well-posed astronomical problems, (1) the distribution of 
the stellar mass gap between the central and the most- 
massive satellite galaxy as a function of halo mass, and 
(2) the probabilities for MW-like halos to have TVmc MC- 
sized satellites. Our conclusions regarding these ques- 
tions are: 

(1) With decreasing halo mass, the mass distribution 
of the most massive satellite as compared to the the dis- 
tribution of the central galaxy become more different and 
shifted to lower masses. This shows this that the central 
is a statistically exceptional galaxy in the halo (group). 
For masses larger than Mh ~ 3 x 10^^ M©, the differences 
become smaller but even in this case only ~ 15% of halos 
seem to have the most massive satellite statistically in- 
distinguishable from the central one, which implies that 
the latter could be a mere statistical realization of the 
massive-end of the satellite CSMF instead of realization 
of a different galaxy. 

(2) For the range of halo masses in question for the 
MW, we find that the probabilities to have A'mc MC- 
sized satellites are in good agreemen t with the obser- 
vational determinations by iLiu et aH ()2011h . A MW- 
halo mass of ;S 2 x 10^^ M© would agree better with the 
observational determinations for two MC-sized satellites 
(A'^MC = 2). When excluding the cases that satellites 
are larger than the LMC, the probabilities become even 
lower: < 2.2% for Mh = 2 x 10^^ Mq. 

We conclude that the semi-empirical results we obtain 
here, both for the central-halo and satellite-subhalo mass 
relations and their intrinsic scatters, are quite robust 
and imply full consistency of the ACDM halo and sub- 
halo populations with several statistical distributions of 
the observed populations of central and satellite galaxies 
down to K'h ^ 10^ Mq. 
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acknowledges a graduate student fellowship provided by 
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APPENDIX 
THE FITTING PROCEDURE 

From the fit to the data, we constrain tlic ten free parameters of modei by maximizing the likeiihood function 
L oc exp(— x^/2). Tabie [T] fists the different combination of observationaf constrains used in this paper. 
For each GSMF, the x^'s are defined as: 

1 ( S."- . .-chJ . V 

^ * bin T 
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where iVbin is the number of bins in the total/central/sateUite GSMF reported for each author with an ith value of 
'^ff'obs error of cr^j^g. The ith value of the total/central/satellite GSMF computed in the model is denoted as 

^5, model" 

For the lYang et"all (|2012[ ) projected 2PCFs, the is defined as: 




X (%,bi„) = Tr > . > . I I ' (A2) 

where A's,bin is the number of stellar mass bins, iVr^bin denotes the number of bins in the 2PCF, Wp'obs(^p*modoi) 
amplitude of the observed (modeled) 2PCF in the jth projected distance bin of the ith stellar mass bin. 

First, we find t he set of paramete rs, a = (ai, a„), that minimizes using the Powell's directions set method in 
multi-dimensions. iPress et all ()1992[ ). Then, the resulting set of parameters is used as the starting point to sample the 
parameter space with the MCMC method. In most of our cases n = 10. We also need to specify for each parameter 
a proposed distribution, which generates the candidate for sampling the parameter space. We assume that each 
proposed distribution is Gaussian distributed. The standard deviation for each parameter, 17(0^), is calculated from 
the covariance matrix. The covariance matrix or error matrix of a is defined as the inverse of the nxn matrix a = e~^, 
computed according to 

a,, = i^V(a). (A3) 
2 dakdai 

Therefore, the standard deviations in the parameters correspond to the square roots of the terms in its diagonal, 
i.e., a{ai) = ^/cii- We consider these numbers as our best initial guess for the diagonal covariance matrix of the 
model parameters. Then the covariance matrix for the proposed matrix is computed according to the formula given 

in (jPunklev et al.ll2005[ ): el^ — —'^eu, with n the number of parameters to be fitted. 

Using these results, we sample a first chain with 100,000 models, from which we compute the diagonal of the 
covariance matrix, e^j. If the ratio of each prior, ^/eTi, to each element of the resulting diagonal covariance matrix, 
y^e^, lies in the range 0.8 < \/eii/e^ < 1.2, then we initialize a second chain with 1.5 x 10® elements for the model 
analysis; else, we repeat the procedure j— times until the ratio of the covariances of the previous chain with the last 



one reachs the condition 0.8 < y el^^/el < 1-2, that is to say, until there is not a sufficiently significant improvement 
in the standard deviations of the model parameters. The j— covariance matrix for the proposed distribution is given by 
ef/"' = ^ tl^. Then, we run a last chain with 1.5 x 10® elements for the model analysis. This procedure usually takes 
one or two iterations. For all the chains, we find a convergence ratio in each parameter lower than 0.01 (IDunklev et al.l 
[2005.1 . 



